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SYSTEM FOR ELECTRONIC REPOSITORY OF DATA ENFORCING 
ACCESS CONTROL ON DATA SEARCH AND RETRIEVAL 



ABSTRACT OF THE DISCLOSURE 



When an electronic document is made available for review by other entities, it is often convenient 
to store the document in a repository or database managed by a third party. A system is provided 
in which the entities authorized by the originator of a document to access it in the data repository, 
are able to securely search for the document filed with a third party repository without having to trust 
the administrator of the repository to provide accurate information on their access privileges. The 
document originator, the repository administrator and all entities with access privileges to data in 
the repository have vault environments which are secure extensions of their respective work spaces. 
The vault of the document originator maintains an access control list (ACL) for each document it 
has deposited with the repository. The vault of each entity with access privileges to any documents 
in the repository maintains a capability list of the entity's access privileges to all of the documents 
stored in the repository. The entity itself maintains proof of the latest version of its capability list 
on its own desktop. When the ACL for a document is updated in the document originator's vault, 
the originator's vault determines what entities are affected by the change, and communicates the 
change to the affected entities* vaults. Each vault updates its own capability list, and at the first 
available opportunity, updates the proof of the latest version of the capability list on the entity's 
desktop. 
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SYSTEM FOR ELECTRONIC REPOSITORY OF DATA ENFORCING 
ACCESS CONTROL ON DATA SEARCH AND RETRIEVAL 



Field of the Invention 

5 The present invention is directed to the field of electronic data storage, and provides, in particular, 
a secure data repository and exchange system administered by a third party data custodian in which 
access control is enforced on data search and retrieval 

Background of the Invention 

10 Recent parallel advances in network communications and public key infrastructure ("PKI") 
technology have prompted businesses and institutions to begin to utilize electronic documentation 
for record-keeping and for transactions of all types. With improvements in transmission integrity 
and security, it can be confidently assumed that documents sent electronically over the Internet and 
other open networks will arrive intact and tamper-free. Database management systems coupled with 

1 5 modern computer memories capable of storing several gigabytes of data have made it practical for 
businesses and institutions to simply dispense with maintaining paper records whose bulk 
necessitates real estate costs. 



Typically, data originating in one entity may have to be transmitted to others for any number of 
20 reasons such as deposit, review, etc. The data elements could be of the form of unstructured 
document files or structured records, such as bank account and other financial information. Using 
the example of unstructured data, it may be necessary to forward a document from the originating 
system to other computers in the same system or to computers residing on different systems for the 
purposes of review. This could occur equally in a business situation (e.g., a proposal for a joint 
25 venture or complex bid tender) as in an institutional setting (e.g., a graduate thesis to be reviewed 
by faculty advisers prior to submission to a university thesis review committee). The document has 
been created electronically since this will facilitate revisions and additions (particularly if it is 
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lengthy) without having to re-type the entire document each time. 

Having the document in an electronic form also facilitates review of it because the document in this 
form is easily transmissible. Intended reviewers can discover that a document is available by 
5 searching the system once they are given access to the storage location of the document. 

There are a number of reasons, such as security, data integrity and system or network availability, 
why the document creator will not want to store a document locally if it meant giving access, behind 
its firewall, to third parties. These reasons are discussed in greater detail in our concurrently filed 
10 patent application titled "System for Electronic Repository of Data Enforcing Access Control on 
Data Retrieval" (IBM Docket No. CA998-030), which is commonly assigned and which application 
is incorporated herein by reference. 

Our concurrently-filed application is directed to a system in which the integrity and access to data 
1 5 stored in a repository is maintained independently of any actions of the third party administrator of 
the repository. 

The invention described in that application is very efficient for systems with large numbers of 
documents accessible to large numbers of users because the information on authorized user access 
20 to the documents is stored in a single, central location, that is, in the repository itself. Users obtain 
secure knowledge of their access to documents by a means external to the system. 

The present invention is a modification in which the system itself contains the information on 
authorized user access, which is also secure from any actions of the third party administrator of the 
25 repository. 
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Summary of the Invention 

Thus, it is an object of the present invention to provide an electronic document storage and 
exchange system in which the documents are physically stored in a repository administered by a 
third party, but which users can search to determine to which documents in the repository they have 
5 access. 

It is also an object of the invention to provide a system in which the integrity and access to 
information about authorized user access to the data stored in the repository is available through the 
system, but is not dependent on the actions of the third party repository administrator. 

10 

Accordingly, in one aspect, the present invention provides a secure system for searching electronic 
data files stored in a data repository system. The system includes a communications environment 
that houses a first agent program for a depositor computer of an electronic data file in the data 
repository system and a second agent program for a first user computer with access privileges to the 

1 5 electronic data file. A manifest for the electronic data file lists access controls for the electronic data 
file. The manifest is accessible to and maintained by the first agent program. The first user 
computer has a record of its access privileges to the electronic data file which is accessible to and 
maintained by the second agent program. When changes are made to the manifest affecting the first 
user computer's access privileges to the electronic data file, these changes are communicated from 

20 the first agent program to the second agent program so that the first user computer's record of its 
access privileges can be updated. The first agent program is also able to verify the first user 
computer's access privileges to the electronic data file before the electronic data file is released to 
the second agent program. 

25 According to another aspect, the invention provides a process for maintaining secure electronic data 
searching of an electronic data repository in a system having a manifest listing access controls for 
each electronic data file stored in the data repository and a record listing document access privileges 
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for each computer with access to electronic data stored in the repository. The process consists of 
updating a manifest for an electronic data file stored in the repository, identifying all computers with 
a change in access to the electronic data file effected by the update, communicating the change in 
access to all affected computers, updating the access privileges records of all affected computers, and 
5 communicating the updated access privilege records to the affected computers. 

In a further aspect, the invention provides a secure system for searching electronic data files stored 
in a data repository system that includes means for maintaining a manifest listing access controls for 
each electronic data file stored in the data repository system, means for restricting access to each 
10 manifest to a computer with deposit privileges, means for maintaining a record listing access 
privileges to the electronic data files associated with each computer with access privileges to at least 
one electronic data file in the data repository system, means for restricting access to each said record 
to the associated computer with access privileges, and means for updating the record associated with 
each computer affected by an access change in a manifest. 

15 

Media encoded with program code to effect the above-described system or processes is also provided 
in the invention. 

Brief Description of the Drawings 
20 Embodiments of the invention will now be described in detail in association with the accompanying 
drawings, in which: 

Figure 1 is a schematic diagram of a document repository system utilizing a third party custodian; 

25 Figure 2 is a schematic diagram, similar to Figure 1, showing a vault document repository system 
utilized in the preferred embodiment of the present invention; 
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Figure 3 is a flow diagram illustrating the process of document creation, according to the invention; 

Figure 4, consisting of Figures 4A and 4B, is a flow diagram illustrating the process of document 
retrieval according to the invention; 

5 

Figures 5 A and 5B are flow diagrams showing a process, according to the preferred embodiment of 
the invention, providing immutability of access control for document search and retrieval; and 

Figure 6 is a flow diagram illustrating a process for assigning ownership privileges over stored 
10 documents, according to the invention. 

Detailed Description of the Preferred Embodiments 

A conventional arrangement for a document repository system utilizing a third party custodian is 
illustrated in Figure 1. A document originator 100 can deposit documents via its connection 102 

15 with a remote document repository service 104, such as a database, administered by a third party. 
As the owner of the deposited documents, the document originator 100 can assign access to the 
documents. For example, the document originator may assign a business partner 106 to have a 
"read" privilege, which means that the assigned business partner is allowed to retrieve the document 
via its connection 108 to the document repository service 104, but cannot make changes to the 

20 document on deposit. 

In such conventional systems, the document deposited by the document originator 100 is normally 
not encrypted so that the business partner 1 06 will be able to review the document on demand. This 
is because there are problems associated with decrypting documents in the prior art. Document 
25 decryption requires access to the private key of the document originator 100. To give access to its 
private key, the document originator 100 must either make itself available online during all times 
that decryption might be requested in order to perform the decryption itself (the issue of system 
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availability), or must set up a scheme in advance to make its private key available directly to the 
business partner 106 or through a trusted proxy (not shown). 

U.S. Patent No. 5,491,750 of International Business Machines Corporation, is for "Method and 
5 Apparatus for Three-Party Entity Authentication and Key Distribution Using Message 
Authentication Codes". This patent describes a system which allows for the distribution of secret 
session-management keys shared by two or more communication partners after the communication 
partners have been authenticated through a trusted intermediary. However, the keys generated under 
this scheme, and others like it, are short-lived and intended to be used as little as absolutely 
10 necessary. It is not clear that such a scheme would be appropriate to provide securely transmit 
decryption keys between communication partners in a document review system, with a persistent 
document repository. 

Thus, in conventional systems where documents are deposited for a period of time and are not 
1 5 encrypted (Figure 1 ), the third party administrator of the repository service 1 04 must be trusted with 
maintaining the integrity of the document. 

The document repository system of the preferred embodiment of the present invention is built using 
the IBM Vault Registry product, the subject of U.S. Patent Application No. 980,022, titled "Secure 

20 Server and Method of Operation for a Distributed Information System", filed November 26, 1 997, 
assigned to IBM Corporation. U.S. Patent Application No. 980,022 is hereby incorporated herein 
by reference. The IBM Vault Registry product provides an enhanced web server environment that 
implements a secure extension, called a vault, of the client's environment. This system relies on the 
modern transmission technology described in the Background of the Invention, that electronic 

25 transmission of documents and other data will arrive intact and error free. Resources contained 
within a client's vault are available only when accessed from the client using strong authentication 
via certified public keys. Depending on the environment, access may be through the client's web 
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The information content of the vault is encrypted for privacy. Each vault on a server has a unique 
encryption key and mechanisms that inhibit access to the keys except through the trust path approved 
5 by the owner of the vault, such as through a browser. Programs that run within a vault are isolated 
by operating system services to ensure such programs; 

a) operate in a process with a system identity (a virtual logon) so that the identity is available to 
dependent processes without the possibility of alteration by a program operating in the vault; 
10 b) have access to the data content of the vault in which they are running - but to no other; 

c) are approved for running in the vault by the owner of the vault; and 

d) are signed to prevent tampering and "Trojan horse" attacks. 

Programs operating in a vault can deposit information in the same vault or other vaults having secure 
1 5 access to each other* s public keys. Normally, these vaults will be located on the same vault server, 
but can be on different vault servers with access to a common Certificate Authority to provide the 
public key information. In the context of a vault repository, "deposit" can mean different things. 
In one implementation, deposit can refer to encrypting the data in the encryption key of the target 
vault and signing the data in the signature key of the depositing vault Vault programs cannot 
20 directly access either encryption or signature keys. This is done through an API. Optionally, the 
"deposit" function can place information in a queue contained in the target vault. Another option 
provides a "return receipt" that ensures that the information was deposited and that a program in the 
target vault opened the data. All of these "deposit" functions provide a means to pass information 
among vaults in such a way that: 

25 

a) their origin process cannot be denied; 

b) their content cannot be viewed by those with the ability to inspect inter-process communication 



7 



CA 02256936 1998-12-23 



CA998-040 
buffers; and 

c) delivery is guaranteed. 

If an application does not choose to queue data to the target vault, it can elect to store the information 
5 in a file, database or use any other system services that can treat the data as an "opaque" item (e.g., 
serializing it for object persistence). This opaque information can be managed by standard system 
techniques for the purpose of backup and recovery. However, its content can only be decrypted by 
a program running in the context of the vault that owns it using the SecureDepositor application 
programming interfaces. 

10 

Using the IBM Vault Registry product, the preferred embodiment of the invention has been 
developed as illustrated schematically in Figure 2. 

As in the system of Figure 1, in the scheme illustrated in Figure 2 a document originator 200 can 
1 5 deposit documents via its connection 202 with a document repository service 204, and as the owner 
of the deposited documents, can assign levels of access to the documents to third parties 206, such 
as business partners, who gain access to the documents in the document repository service 204 via 
their own network connections 208. However, unlike the system described above, users of the 
document repository system are not obliged to trust the third party to maintain the integrity of 
20 documents filed in the repository. 

The document repository system 204 of the preferred embodiment comprises two components, an 
application server 210 and a vault controller 214. The application server (AS) is a program to 
administer the database repository 212, which may be on the same machine or may be remotely 
25 located on a closed network. The vault controller 214 includes a number of components, user vaults 
216, 218 which are assigned on an individual basis to document originators 200 and business 
partners 206, an AS vault 220 assigned to the application server 21 0, and a vault supervisor program 
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222. 

A user vault 216 or 218 can be accessed only by the user (document originator 200 or business 
partner 206) to whom the vault has been assigned, upon proper authentication. The individual vaults 
5 do not have direct access to the document database 212, access is through the AS vault 220 and the 
application server 210. 

The application server component 2 1 0 does not run on a trusted computing base, but can execute on 
any platform. The application server has a reciprocating component which runs in the AS vault 220 
1 0 assigned to it in the vault server 2 1 4. The AS vault 220 can communicate with the application server 
210, and through the application server, has access to the document database 212. 

Figure 3 is a flow diagram illustrating the process of document creation, according to the preferred 
embodiment of the invention. Using the IBM Vault Registry environment, a personal vault is 
15 notionally a secure extension of the vault owner's environment. Thus, the interaction between the 
process steps in Figure 3 are shown between the vaults of the document originator and the 
application server. 

When creating a document in the data repository, the document is first sent from the desktop of the 
20 user who created or originated it, to the user's (document originator's) personal vault (block 300), 
where the document is "signed" with the user vault's private signing key (block 302). 

An electronic signature of a data element is a guarantee, provided by the signatory, of the integrity 
of that data element. A signature may be computed by first computing the data element's digest. 
25 The digest is a relatively small structure (e.g. 128 bits for MD2 or MD5 digest) with specific 
properties to guarantee security. First, it is a one-way function, which means that given a digest, it 
is impossible to obtain the original document that produced it. In addition, given a digest, it is 
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impossible (or computationally infeasible) to find a second pre-image, which would have the same 
digest. The digest is also collision-resistant. This means that two different pre-images are highly 
unlikely to produce the same digest. 

5 The data element's digest is then encrypted with the user vault application's private signing key 
(block 304). In the preferred embodiment, both symmetric and public-key asymmetric cryptography 
technology are utilized. 

With public key cryptography, an application has two keys, a public key and a private key, referred 
10 to as a key pair. The private key is held locally by the application, and is discussed in further detail 
below. The public key is made available to all users, usually through a directory service, such as 
X.500 distributed directory. Public key distribution is well known in the art, and is not discussed 
in further detail in the present specification. 

1 5 When public key cryptography is used, a data element encrypted with the public key may only be 
decrypted with the corresponding private key. Similarly, a data element encrypted with the private 
key may only be decrypted with the public key. 

In symmetric key technology, a single key is used for both encryption and decryption. In current 
20 practice, encryption/decryption and key generation are much faster in symmetric key technology 
than with public-key asymmetric technology. 

Data is normally encrypted using a randomly generated symmetric key. Then, the symmetric key 
is itself encrypted using the user's public encryption key, and is stored with the document so that it 
25 becomes part of the document. 

Continuing with Figure 3, the encrypted document and the electronic signature are forwarded to the 
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application server's vault for filing in the document database (block 306). On receiving the 
encrypted document (block 308), the application running in the application server's vault notarizes 
the signature (block 310) by re-signing it with its own private signing key. 

5 The notarization of a signature, in an electronic context, means that a third party, which acts as a 
"notary", certifies the content of a signature. (All of the duties attending the legal office of notary, 
as conferred by a government authority, are not intended to be covered by the references in this 
specification to "notary" and "notarization".) In general, electronic notarization of a signature is 
done as an extra precaution, in order to prevent unauthorized modification of the signature at a later 

10 time. In the case of the present invention, notarization of the user's digital signature prevents the 
user from replacing or modifying the original document in the document repository. A check of the 
notarized signature associated with the document would reveal any inconsistencies. 

A notarized electronic signature contains two pieces of information, the originator's signature of the 
15 given data element and the notary's signature of the originator's signature. The notary's signature 
should be computed over the originator's signature and the current time stamp. 

The application running in the application server's vault then signs the document it has received 
(block 312). Because the data it receives from the document originator is encrypted, the application 

20 server actually has no knowledge of the contents of the document. Therefore, according to the 
invention, this second signature is computed over the encrypted document, and the originator's 
notarized signature. The application server's signature constitutes a non-repudiation receipt, 
providing proof to the document originator (depositor) that the repository service received the 
document. The creation of the document in the repository may then not be denied later by the 

25 repository service. 

The encrypted document, the document originator's notarized signature, and the non-repudiation 
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receipt are all stored in the application server's repository or the application database (block 314). 
The non-repudiation receipt is sent to the vault of the document originator (block 3 1 6). The vault 
of the document originator checks the correctness of the non-repudiation receipt (block 3 1 8) by 
verifying the signature of the encrypted document. The document originator's vault also checks the 
5 currency of the time stamp in the notarized signature (block 320). The tolerance for the time stamp 
is application dependent If either of these tests fail, an error message is returned to the AS vault 
(block 322) and logged in the system. If the receipt is correct and current, the application running 
in the user's vault returns the non-repudiation receipt to the originating user (block 324) to be cached 
locally for future reference, in the event proof is required that the document has been stored in the 
10 repository. 

It is possible that the document originator may sign and/or encrypt the document using his own 
proprietary technique, before submitting it to his vault for storage. However, the document 
repository is not sensitive to the content of the document being stored. Hence, an encrypted 
15 document will be re-signed and re-encrypted by the user's vault, as any other document would be 
handled. 

Figure 4 is a flow diagram illustrating the steps, according to the preferred embodiment of the 
invention, which must take place to permit a document to be retrieved by a requestor who has been 
20 authorized under a type of manifest maintained by the document originator for each document called 
an access control list (ACL). As in Figure 3, the process steps have been divided between the three 
actors, user, application server and requestor, on the basis that the personal vaults of each are 
notional secure extensions of their respective work spaces. 

25 Beginning with Figure 4A, the requestor makes a request to his vault application to retrieve a 
document from the application server repository (block 400), and the requestor's vault application, 
in turn, forwards the request for the document to the application server's vault (block 402). 

12 
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The application server's vault application receives the access request (block 404) and retrieves the 
encrypted document and the originator's notarized signature from the application database (block 
406). 

The application server's vault application sends the encrypted document and the notarized signature 
to the vault of the document originator. The application server's vault also sends the identity of the 
requester's vault to the originator's vault (block 408). 

The originator's vault checks that the requestor is authorized to retrieve the document (block 410). 
In the preferred embodiment, document access control is enabled through access control lists used 
to restrict document access only to authorized entities. An access control list (ACL) is associated 
with a document, and is stored and maintained in the document originator's vault as described below 
in association with Figures 5 A and 6. The ACL must be checked when a requestor sends a request 
to retrieve a document A requestor will only be given a copy of the document if it has authorized 
access. 

According to the preferred embodiment of the invention, capability lists can be employed to enable 
requesters to verify their access to documents in advance of making access requests. A capability 
list identifies all documents in a repository to which a particular user has access privileges. A 
requester's capability list is stored and maintained in its own vault. The requestor determines which 
documents it has access to simply by checking its list. The use and maintenance of capability lists 
are discussed in greater detail below in association with Figure 5B. 

If the requestor is not authorized to access the document, an error message is returned to the 
originator and is logged in the system (block 414). 
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Continuing with Figure 4B, if the requestor is authorized to receive the document, the originator's 
vault application decrypts the document (block 416) and verifies the notarized signature (block 41 8). 
Since the originator's original signature was computed over the unencrypted document contents, 
only those users with access to the document contents (i.e., with the originator's private key) are able 
5 to verify the signature. If the signature does not correspond with what the document originator has 
in its own files, then it will be clear that it is not the same version of the document as deposited, and 
the originator will return an error message to the application server (block 420). 

If the signature is verified, the originator forwards the decrypted document and the notarized 
10 signature to the requestor's vault (block 422). 

On receipt of the decrypted document, the requestor's vault application attempts to verify the 
originator's notarized signature (block 424). If the requestor is unable to verify it, an error message 
is returned to the originator and is logged in the system (block 426). 

15 

If the originator's notarized signature can be verified, the requestor's vault signs the notarized 
signature that it received with the document. This signature is computed over the notarized 
signature, as well as the current time stamp, and constitutes a non-repudiation receipt (block 428) 
proving that the requestor retrieved the document from the repository. The requestor's vault returns 

20 the decrypted document along with the non-repudiation receipt it has generated to the requestor's 
desktop (block 430). The requestors vault also forwards the non-repudiation receipt to the 
application server's vault (block 432). The application server verifies the signature of the requestor 
vault on the receipt (block 434). If the signature cannot be verified, an error message is returned to 
the originator and is logged in the system (block 436). If the signature can be verified, the 

25 application server vault stores the receipt in the application database (block 438) for later use if the 
application server must prove that the requestor did retrieve the document. 
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Immutability of Access Control for Document Retrieval 

As discussed above, in a data repository, there is a requirement for document access control. This 
means that only those users authorized by the document's owner, are able to view the document, and 
that document access permissions may only be modified by the document's owner (i.e., the document 
5 originator) and those other users that have been authorized by the document's owner to modify the 
given document's access control list. It is important to have assurance that even the repository 
administrator does not have the power to modify a document's access permissions without 
authorization from the document's owner. 

10 There are two different types of application requirements for the immutability of document access 
control. Document access must be checked: 

1) when a user performs a search to find all documents that it is authorized to view; and 

2) when a user performs an actual retrieval of a document. 

15 

All applications must enforce access control on document retrieval (access type 2 above). For this 
type of access, the repository must guarantee that the access control of a document cannot possibly 
be modified by an unauthorized user, such as a business competitor. 

20 However, in some applications, it is not essential for a user to be able to query the document 
repository for all documents that it is authorized to see. For example, this knowledge may be 
communicated off-line through business meetings or over the phone. In that case, the user already 
knows what documents it has access to, and therefore the requestor's knowledge of its own 
document access cannot be affected by actions of the repository. 

25 

A system that enforces immutability of access control only on document retrieval, and not on 
document search, is the subject matter of our concurrently filed application titled "System for 
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Electronic Repository of Data Enforcing Access Control on Data Retrieval" (IBM Docket No. 
CA998-030). In this system the access control information is stored in the database/repository of 
the application server. 

5 A stronger form of access control immutability, which could be used where users do not have 
independent information about their document access, involves both document search and document 
retrieval. For this requirement, the access control information cannot be stored in the application 
database. Instead, it is stored in the document owner's vault. This scheme is the subject matter of 
the present invention, and it is illustrated through the flow diagrams of Figures 5 and 6, and 
1 0 discussed below. 

In the preferred embodiment, each document has, associated with it, an access control list (ACL) 
which identifies the access authorization to the document for different users. In addition, each user 
in the system has a capability list which identifies all stored documents which the user does not own, 
15 but to which it has access. 

In order to guarantee immutability, each ACL is processed in the document owner's vault as 
illustrated in Figures 5A, and, in parallel, each capability list is processed in the associated user's 
vault as illustrated in Figure 5B. 

20 

Beginning with Figure 5 A, whenever an ACL has been updated (block 500), the document owner's 
vault determines which users are affected by the change (block 502), and a message identifying the 
type of change in access (access added, expanded or restricted) is deposited in the vault of each user 
whose access to the document has been modified (block 504). 

25 

Each ACL is associated with a version number and a time stamp of its latest modification. Thus, 
the document owner' s vault then increments the ACL ' s version number (block 5 06) and replaces the 
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old time stamp associated with it, with the most current time stamp (block 508). A token, which is 
intended to guarantee the immutability of the ACL, is created from the current version number and 
time stamp now associated with the ACL, and is signed by the document originator's vault (block 
510). The ACL is also signed by the document originator's vault (block 512). 

The ACL token is then forwarded to the vault of any user with authorized access to the document, 
for storage with the user's access application on its desktop (block 5 14) for future ACL verification. 
The signed token is forwarded to the document originator's desktop for storage (block 516). 
Because it is holding a copy of the signed token, the document originator becomes the final arbiter 
of whether the document ACL is up-to-date or not. 

When a business partner wants to retrieve a document, the AS vault application sends the encrypted 
document to the originator's vault as described above (block 408 of Figure 4A). In order to verify 
the requestor's authorization (block 412 of Figure 4A), the document originators vault simply 
checks in. the. verified ACL,, stored locally,, that the requester has. access to. the. iadicsted document, 
With this technique, nobody can modify the ACL stored in the application database, without that 
change being detected by the document originator's vault. 

As described above, each user which owns documents in the repository, keeps on its desktop the 
signed tokens of the correct version of each ACL. The ACL versions kept by the user's vault are 
verified by comparing the signed token stored on the user's desktop with the one stored in the user's 
vault. This comparison can be performed at different times; one good opportunity to verify the 
ACLs stored inside a user's vault is during logon, so that every time the user logs on, the user's 
ACLs are verified. 

If ACL verification fails, the user's vault application can automatically stop honoring all requests to 
retrieve the document protected by the ACL. This state of document inaccessibility would persist 
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until the user either creates a new ACL, or re-certifies the existing ACL. The process of re- 
certification of the existing ACL would include synchronizing the ACL token stored in the user's 
vault with the token stored on the user's desktop. 

5 Whenever an ACL is updated, a number of steps take place in parallel to those set out in Figure 5 A. 
These additional steps are illustrated in Figure 5B. 

Each user's vault is responsible for maintaining a capability list, which contains a list of all the 
documents to which the user has access. The currency of the capability list itself is identified with 

10 a version number and last time stamp. This, when a message indicating a modification in a user's 
capability to access a document (an update to a document ACL) arrives in the user's vault (block 
520), the capability list in the user's vault is automatically updated by version number (block 522) 
and latest time stamp (block 524). A token is computed over the version number and time stamp 
(block 526) which can be used to verify the correctness of the capability list. The token is signed 

15 by the user's vault (block 528), as is the capability list (block 530). The signed token and capability 
list are both stored in the user's vault (block 532), but the user's vault preserves the old capability 
list and its associated token, as the token for the old capability list corresponds with the token stored 
on the user's desktop until an update can be made. 

20 One way to synchronize the current capability list with the corresponding user's token stored on the 
desktop is to do so automatically when the user logs on (block 532). The correctness of the token 
in the user's desktop can be checked against the old token preserved in the user's vault, and then the 
updated token sent to the user's desktop (block 534). Once the old token has been replaced on the 
user's desktop, the old capability list and its associated token can be deleted from the user's vault. 

25 

Another alternative for updating the capability list token on the user's desktop (not shown) is to 
require the user to take the initiative to review updates to the capability list since its last logon. 
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To assure the correspondence between ACLs and the capability lists, the environment on which the 
system is based (e.g., the IBM Vault Registry product) must provide guaranteed message delivery 
for messages deposited from one vault to another. The guaranteeing of the delivery of a capability 
update may also be done by the application, by for example requiring a receipt from the user who 
5 receives the update. 

As a result of this scheme, the ACL and the capability list are all stored by their respective owners. 
No party in the system can possibly change the access control list of a document, without the 
document owner becoming aware of the change. Furthermore, no party in the system can change 
10 a user's knowledge of access to a document (i.e., a capability) without the authorized user become 
aware of the change. 

In contrast to the access control scheme described in our above-identified co-pending application in 
which searches are done by the application server's vault, with the present invention, searches for 
15 documents a user is authorized to access are done by the user's own vault application. 

A ssignment of Ownership Privileges 

Some environments require that a document owner be able to allow someone else to modify the 
access list of the given document. For example, if the owner is not available, another authorized user 
20 will have the ability to update the given document's access control. 

According to a preferred embodiment of the invention, the updating of ACLs or capability lists can 
be performed by other users in the system following the steps set out in Figure 6. 

25 For example, when an ACL update is attempted, the updating user must be able to present the up-to- 
date signed token for the ACL (block 600). The signed token is sent to the user's own vault (block 
602) which passes the signed token to the originator's vault (block 604). If the updating user does 
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not have ownership privileges assigned in the ACL of this document, then the document owner's 
vault will detect this, refuse the update and return an error message to the user's vault (blocks 606, 
608). 

5 If the originator's vault can verify the signing user's access to the document, and that the version 
number, and time stamp of the AGL token are current (block 606), the ACL is updated (block 6 1 0) 
and a new token is generated and signed (block 612), and stored in the originator's vault (block 714). 
The new signed token is sent to the updater ! s vault (block 616). The updater's vault returns the new 
token to the updater's desktop for storage (block 618). The new signed token may also, optionally, 
10 be forwarded to the application server's vault for storage in the repository (block 620). 

This procedure requires that only one person may perform an ACL update at any given time. For 
example, if a document owner John will be leaving for vacation, he can permit a co-worker, Mary, 
to update the ACL of his document in his absence from the office by giving Mary his up-to-date 
15 token for the ACL of the document. Mary then issues an ACL update by presenting the token 
through her vault to John's vault. Mary receives the new signed token for the ACL which she 
returns to John on his return to the office. After installing the new token, John can issue his own 
ACL update. 

20 Data Backup and Recovery 

Occasionally it may be necessary for the administrator of the document repository to restore the 
document database from a previous backup. This may be necessary, for example, in case of a 
catastrophic database failure, such as a disk crash. 

25 The data that needs to be included in a backup are the documents themselves, the ACLs (whether 
stored in the application database or in the owner's vaults), the capability lists (for the systems the 
implement them, as described above), and the verification tokens of ACLs and capability lists. 
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Following a data r storation, updat s made after the most recent backup may be Lost. For the 
purposes of the present invention, this could include ACL and capability list updates. When this 
occurs, the verification tokens stored on user desktops may not correspond with the tokens in the 
corresponding vaults, denying users proper access. 

5 

Therefore, the following system has been implemented to provide a standard for data restoration in 
different situations. It is assumed that the backup was taken at TTME1, while the restoration 
occurred at a later TIME2. 

10 Where the a complete data restore of the document database, ACLs, capability lists and 
corresponding tokens stored in the vaults is performed, users authorized to access a document before 
TTME1 , can also access it after HME2. This means that if a user was authorized before TIME1 , but 
the authority was revoked after TIME 1 but before TIME2, the user will have document access until 
the document owner performs a check of the ACL token. Ail users should therefore do an ACL and 

15 capability list check following a complete data restoration. 

Where only the document database was restored, while the ACLs, the capability lists and the vault- 
stored tokens are untouched, users may find that they are authorized to access a document which 
does not exist in the database because the document has been added after TIME1, but subsequently 
20 lost in the database restore. Since all tokens are up-to-date, no other anomalies will occur. 

Another case involves a system in which capability lists are not used, but the ACLs are stored in the 
application database. Where the document database and the ACL have been restored, while the 
vault-stored tokens have not, users will find that all documents whose ACL changed after TIME1 
25 are inaccessible. This is because the ACL tokens in the application database do not match the tokens 
stored in the individual owners' vaults. To deal with this, all document owners will have to update 
the ACLs. One way to do this is for the administrator to send the old ACLs (which were in effect 
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at TIME1) to document owners, and ask them to re-install the corresponding tokens in their vaults. 
This update would be manual, not automatic, and an owner's documents are inaccessible until the 
owner has performed the update. 

5 For situations where database inconsistencies must be avoided, the repository administrator may 
disable access to all documents after a restore, until the originator takes corrective action. This 
disabling of access may apply to all documents stored in the repository, or to only a subset of the 
documents, whose consistency is the most critical. In this case, the repository administrator must 
be relied upon to preserve consistency of the system. However, as described above, the 
10 administrator in any case does not have the power to grant or revoke users' access to a document. 

Preferred embodiments of the invention implemented by means of the IBM Vault Registry product 
have been described. However, it will be obvious to the person skilled in the art that the present 
invention could be implemented using other products that provide similar functions, such as secure 
1 5 vault-like environments located locally with each user's desktop. Modifications of this nature and 
others that would be obvious to the person skilled in the art are intended to be covered by the scope 
of the appended claims. 
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THE EMBODIMENTS OF THE INVENTION IN WHICH AN EXCLUSIVE PROPERTY 
OR PRIVILEGE IS CLAIMED ARE DEFINED AS FOLLOWS: 



1 1. A secure system for searching electronic data files stored in a data repository system, 

2 comprising: 

3 a communications environment housing 

4 (i) a first agent program for a depositor computer of an electronic data file in the 

5 data repository system and 

6 (ii) a second agent program for a first user computer with access privileges to the 

7 electronic data file; 

8 a manifest for the electronic data file listing access controls for the electronic data file, the 

9 manifest being accessible to and maintained by the first agent program; 

1 0 a first record of the first user computer's access privileges to the electronic data file, the first 

1 1 . record being accessible to and maintained by the second agent program; 

12 means to communicate changes to the manifest affecting the first user computer's access 

13 privileges to the electronic data file from the first agent program to the second agent program for 

14 updating the first record; and 

1 5 means for the first agent program to verify the first user computer's access privileges to the 

16 electronic data file before the electronic data file is released to the second agent program. 

1 2. The secure system, according to claim 1, wherein the first agent program is a secure 

2 extension of the depositor computer and the second agent program is a secure extension of the first 

3 user computer. 

1 3. The secure system, according to claim 2, further comprising means for communicating the 

2 changes to the manifest affecting the first user computer' s access privileges to the electronic data file 

3 from the second agent program to the first user computer. 
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1 4. The secure system, according to claim 1 or 2, further comprising: 

2 a third agent program for a second user computer with access privileges to the 

3 electronic data file; and 

4 a second record of the second user computer's access privileges to the electronic data file, 

5 the record being accessible to and maintained by the third agent program, 

6 and wherein the means to communicate changes to the manifest affecting the first user computer's 

7 access privileges to the electronic data file from the first agent program to the second agent program 

8 for updating the first record, comprises means to communicate changes to the manifest affecting the 

9 second user computer's access privileges to the electronic data file from the first agent program to 

10 the third agent program for updating the second record; and 

1 1 wherein the means for the first agent program to verify the first user computer's access privileges 

12 to the electronic data file before the electronic data file is released to the second agent program, 

13 comprises means for the first agent program to verify the second user computer's access privileges 

14 to the electronic data file before the electronic data file is released to the third agent program. 

1 5. The secure system, according to claim 4, wherein the third agent program is a secure 

2 extension of the second user computer. 

1 6. The secure system, according to claim 5, further comprising means for communicating the 

2 changes to the manifest affecting the second user computer's access privileges to the electronic data 

3 file from the third agent program to the second user computer. 

1 7, The secure system, according to claim 2 or 5, wherein the communications environment 

2 comprises a server. 

1 8. The secure system, according to claim 1 , 2 or 5, further comprising an interface to the data 
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2 repository system housed in the communications environment, the interface adapted to receive all 

3 communications to and from the data repository system and the agent programs. 

1 9. The secure system, according to claim 8, wherein the interface is a secure extension of the 

2 data repository system. 

1 10. A process for maintaining a secure electronic data search system for an electronic data 

2 repository, the system having a manifest listing access controls for each electronic data file stored 

3 in the data repository and a record listing document access privileges for each computer with access 

4 to electronic data stored in the repository, the process comprising the steps of: 

5 updating a manifest for an electronic data file stored in the repository; 

6 identifying all computers with a change in access to the electronic data file effected by the 

7 update; 

8 communicating the change in access to all affected computers; 

9 updating the access privileges records of all affected computers; and 

10 communicating the updated access privilege records to the affected computers. 

1 11. A secure system for searching electronic data files stored in a data repository system, 

2 comprising: 

3 means for maintaining a manifest listing access controls for each electronic data file stored 

4 in the data repository system; 

5 means for restricting access to each manifest to a computer with deposit privileges; 

6 means for maintaining a record listing access privileges to the electronic data files associated 

7 with each computer with access privileges to at least one electronic data file in the data repository 

8 system; 

9 means for restricting access to each said record to the associated computer with access 

10 privileges; and 
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1 1 means for updating the record associated with each computer affected by an access change 

12 in a manifest 

1 12. A computer-readable memory for storing the instructions for use in the execution in a 

2 computer of the process of claim 1 0. 



26 



CA 02256936 1998-12-23 

# 




Deposit Documents, 
Assign access control 




View documents 




FIGURE 1 
PRIOR ART 



CA 022S6936 1998-12-23 




FIGURE 2 



CA 0225693 



6 1998-12-23 



DOCUMENT ORIGINATOR'S VAULT 



300 

RECEIVE DOCUMENT FROM 
DOCUMENT ORIGINATOR'S DESKTOP 



302 

SIGN DOCUMENT USING PRIVATE KEY 



APPLICATION SERVER VAULT 



304 

ENCRYPT DOCUMENT 
USING PUBLIC KEY 



306 

FORWARD ENCRYPTED 
DOCUMENT AND 
ELECTRONIC SIGNATURE 
TO APPLICATION 
SERVER'S VAULT 



308 

RECEIVE ENCRYPTED 
DOCUMENT 




316 
RETURN 
NON-REPUDIATION 
RECEIPT AND 
NOTARIZED 
SIGNATURE TO 

DOCUMENT 
ORIGINATOR'S 
VAULT 



3 

NOT ARK 
ELECTRONIC 


10 

£E USER'S 

; SIGNATURE 






3 

PRODUCE NON 
RECEIPT ON 
ENCRYPTED 


L2 

-REPUDIATION 

NOTARIZED 

DOCUMENT 



314 

STORE ENCRYPTED 
DOCUMENT, NOTARIZED 
ELECTRONIC SIGNATURE 
AND NON-REPUDIATION 
RECEIPT IN REPOSITORY 



322 

RETURN ERROR 
MESSAGE TO 
APPLICATION SERVER 
VAULT 



RETURN NON-REPUDIATION 

RECEIPT TO DESKTOP OF 
DOCUMENT ORIOINATOR TO 
CACHE LOCALLY 



FIGURE 3 



CA 02256936 1998-12-23 




CA 02256936 1998-12-23 




CA 02256936 1998-12-23 



FIGURE 5A 



502 

DETERMINE USERS AFFECTED BY UPDATE 









504 

FORWARD MESSAGE OF ACL UPDATE TO VAULTS OF AFFECTED USERS 












506 

INCREMENT ACL VERSION NUMBER 





, 1 . 

508 

REPLACE TIME STAMP WITH CURRENT ONE 

i 

510 

CREATE AND SIGN TOKEN OF ACL VERSION NUMBER AND TIME STAMP 



512 
SIGN ACL 



514 

FORWARD TOKEN TO AUTHORIZED USERS' VAULTS FOR 
STORAGE ON USER DESKTOPS 

1 

516 

FORWARD SIGNED TOKEN TO DOCUMENT ORIGINATOR'S 
DESKTOP FOR STORAGE 




CA 02256936 1998-12-23 



520 

RECEIVE MESSAGE OF ACL UPDATE FROM 
DOCUMENT OWNER'S VAULT 



522 

INCREMENT VERSION NUMBER OF CAPABILITY 
LIST STORED IN USER'S VAULT 



524 

REPLACE TIME STAMP IN CAPABILITY LIST WITH CURRENT ONE 



526 

COMPUTE NEW TOKEN FOR CAPABILITY LIST 



528 

SIGN TOKEN 



530 

SIGN UPDATED CAPABILITY LIST 




534 

UPDATE TOKEN STORED ON USER'S DESKTOP 



536 

DELETE OLD CAPABILITY LIST AND ITS 
ASSOCIATED TOKEN IN THE USER'S VAULT 



FIGURE 5B 



CA 02256936 1998-12-23 

• 



IJPDATINCi USER'S VAULT 



ORIflTNATQR'S vault 



600 

UPDATING USER PRESENTS 
SIGNED TOKEN FOR ACL 



602 

FORWARD SIGNED TOKEN 
TO OWN VAULT 



604 

FORWARD SIGNED TOKEN TO 
ORIGINATOR'S VAULT 



608 

RETURN ERROR MESSAGE 
TO USER'S VAULT 



NO 



NO 




612 
UPDATE ACL 



614 

CREATE AND SIGN NE 1 
VERSION NUMBER Al 


W TOKEN OF ACL 
MD TIME STAMP 






616 

STORE NEW SIGNED T 


OKEN IN VAULT 



618 

FORWARD NEW SIGNED TOKEN 
TO USER'S VAULT 



620 

FORWARD NEW SIGNED TOKEN TO 
UPDATING USER'S DESKTOP 



FIGURE 6 



